22 research outputs found
The Universe at Extreme Scale: Multi-Petaflop Sky Simulation on the BG/Q
Remarkable observational advances have established a compelling
cross-validated model of the Universe. Yet, two key pillars of this model --
dark matter and dark energy -- remain mysterious. Sky surveys that map billions
of galaxies to explore the `Dark Universe', demand a corresponding
extreme-scale simulation capability; the HACC (Hybrid/Hardware Accelerated
Cosmology Code) framework has been designed to deliver this level of
performance now, and into the future. With its novel algorithmic structure,
HACC allows flexible tuning across diverse architectures, including accelerated
and multi-core systems.
On the IBM BG/Q, HACC attains unprecedented scalable performance -- currently
13.94 PFlops at 69.2% of peak and 90% parallel efficiency on 1,572,864 cores
with an equal number of MPI ranks, and a concurrency of 6.3 million. This level
of performance was achieved at extreme problem sizes, including a benchmark run
with more than 3.6 trillion particles, significantly larger than any
cosmological simulation yet performed.Comment: 11 pages, 11 figures, final version of paper for talk presented at
SC1
Analyzing and Visualizing Cosmological Simulations with ParaView
The advent of large cosmological sky surveys - ushering in the era of
precision cosmology - has been accompanied by ever larger cosmological
simulations. The analysis of these simulations, which currently encompass tens
of billions of particles and up to trillion particles in the near future, is
often as daunting as carrying out the simulations in the first place.
Therefore, the development of very efficient analysis tools combining
qualitative and quantitative capabilities is a matter of some urgency. In this
paper we introduce new analysis features implemented within ParaView, a
parallel, open-source visualization toolkit, to analyze large N-body
simulations. The new features include particle readers and a very efficient
halo finder which identifies friends-of-friends halos and determines common
halo properties. In combination with many other functionalities already
existing within ParaView, such as histogram routines or interfaces to Python,
this enhanced version enables fast, interactive, and convenient analyses of
large cosmological simulations. In addition, development paths are available
for future extensions.Comment: 9 pages, 8 figure
Particle mesh simulations of the Lyman-alpha forest and the signature of Baryon Acoustic Oscillations in the intergalactic medium
We present a set of ultra-large particle-mesh simulations of the LyA forest
targeted at understanding the imprint of baryon acoustic oscillations (BAO) in
the inter-galactic medium. We use 9 dark matter only simulations which can, for
the first time, simultaneously resolve the Jeans scale of the intergalactic gas
while covering the large volumes required to adequately sample the acoustic
feature. Mock absorption spectra are generated using the fluctuating
Gunn-Peterson approximation which have approximately correct flux probability
density functions (PDFs) and small-scale power spectra. On larger scales there
is clear evidence in the redshift space correlation function for an acoustic
feature, which matches a linear theory template with constant bias. These
spectra, which we make publicly available, can be used to test pipelines, plan
future experiments and model various physical effects. As an illustration we
discuss the basic properties of the acoustic signal in the forest, the scaling
of errors with noise and source number density, modified statistics to treat
mean flux evolution and misestimation, and non-gravitational sources such as
fluctuations in the photo-ionizing background and temperature fluctuations due
to HeII reionization.Comment: 11 pages, 10 figures, minor changes to address referee repor
Haloes gone MAD: The Halo-Finder Comparison Project
[abridged] We present a detailed comparison of fundamental dark matter halo
properties retrieved by a substantial number of different halo finders. These
codes span a wide range of techniques including friends-of-friends (FOF),
spherical-overdensity (SO) and phase-space based algorithms. We further
introduce a robust (and publicly available) suite of test scenarios that allows
halo finder developers to compare the performance of their codes against those
presented here. This set includes mock haloes containing various levels and
distributions of substructure at a range of resolutions as well as a
cosmological simulation of the large-scale structure of the universe. All the
halo finding codes tested could successfully recover the spatial location of
our mock haloes. They further returned lists of particles (potentially)
belonging to the object that led to coinciding values for the maximum of the
circular velocity profile and the radius where it is reached. All the finders
based in configuration space struggled to recover substructure that was located
close to the centre of the host halo and the radial dependence of the mass
recovered varies from finder to finder. Those finders based in phase space
could resolve central substructure although they found difficulties in
accurately recovering its properties. Via a resolution study we found that most
of the finders could not reliably recover substructure containing fewer than
30-40 particles. However, also here the phase space finders excelled by
resolving substructure down to 10-20 particles. By comparing the halo finders
using a high resolution cosmological volume we found that they agree remarkably
well on fundamental properties of astrophysical significance (e.g. mass,
position, velocity, and peak of the rotation curve).Comment: 27 interesting pages, 20 beautiful figures, and 4 informative tables
accepted for publication in MNRAS. The high-resolution version of the paper
as well as all the test cases and analysis can be found at the web site
http://popia.ft.uam.es/HaloesGoingMA
Haloes gone MAD: The Halo-Finder Comparison Project
We present a detailed comparison of fundamental dark matter halo properties retrieved by a substantial number of different halo finders. These codes span a wide range of techniques including friends-of-friends, spherical-overdensity and phase-space-based algorithms. We further introduce a robust (and publicly available) suite of test scenarios that allow halo finder developers to compare the performance of their codes against those presented here. This set includes mock haloes containing various levels and distributions of substructure at a range of resolutions as well as a cosmological simulation of the large-scale structure of the universe. All the halo-finding codes tested could successfully recover the spatial location of our mock haloes. They further returned lists of particles (potentially) belonging to the object that led to coinciding values for the maximum of the circular velocity profile and the radius where it is reached. All the finders based in configuration space struggled to recover substructure that was located close to the centre of the host halo, and the radial dependence of the mass recovered varies from finder to finder. Those finders based in phase space could resolve central substructure although they found difficulties in accurately recovering its properties. Through a resolution study we found that most of the finders could not reliably recover substructure containing fewer than 30-40 particles. However, also here the phase-space finders excelled by resolving substructure down to 10-20 particles. By comparing the halo finders using a high-resolution cosmological volume, we found that they agree remarkably well on fundamental properties of astrophysical significance (e.g. mass, position, velocity and peak of the rotation curve). We further suggest to utilize the peak of the rotation curve, vmax, as a proxy for mass, given the arbitrariness in defining a proper halo edg
Heritage
Quarterly publication containing articles related to the preservation of historic artifacts and sites in Texas. Feature articles discuss various aspects of Texas history and heritage, often highlighting museums and collections within the state. Also included are book reviews, current preservation news, and a listing of historical museums in Texas
Recommended from our members
Scalable Contour Tree Computation by Data Parallel Peak Pruning.
As data sets grow to exascale, automated data analysis and visualization are increasingly important, to intermediate human understanding and to reduce demands on disk storage via in situ analysis. Trends in architecture of high performance computing systems necessitate analysis algorithms to make effective use of combinations of massively multicore and distributed systems. One of the principal analytic tools is the contour tree, which analyses relationships between contours to identify features of more than local importance. Unfortunately, the predominant algorithms for computing the contour tree are explicitly serial, and founded on serial metaphors, which has limited the scalability of this form of analysis. While there is some work on distributed contour tree computation, and separately on hybrid GPU-CPU computation, there is no efficient algorithm with strong formal guarantees on performance allied with fast practical performance. We report the first shared SMP algorithm for fully parallel contour tree computation, with formal guarantees of O(lg V lg t) parallel steps and O(V lg V) work for data with V samples and t contour tree supernodes, and implementations with more than 30Ă parallel speed up on both CPU using TBB and GPU using Thrust and up 70Ă speed up compared to the serial sweep and merge algorithm